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a _ AbStract 
\ ‘ 
ie : 4 “ond . 
\ 4 ‘ 
.\ ,fhis study was an attegpt to extend the Basic Word Vo-- 
3 a a © 


\ ais 


; a cabulary Test (BWVT) , developed by Dupuy (1974), downward ae a 
* ° \ : 
by administering it orally. Four questions were investi--.« 
gated: : -: | ; OE hlae, “Sat ee 


\ ‘ . os: s 
“1, How: large afte the aural understanding basic: word 
} vocabularies > children in ‘grades née through four? ° 

eae ‘Do significant differencés.exist among qrade-lev-" 
7 ‘pls? . % es : ; vi | ; | as ae 
; ae DO significant sittcrencks exist’ between the oral- ee 
A ie administered BWVT scores and phe: Peadine BWVT scores of, 
« . third and Fourth wade children? . | : 
4. What correlations exist, between the oral Reo ’ 


and IQ, ahd between oral scores and the Gates-MacGinitie 
ww E ey : 
Reading Test results? es , ‘ af , se 


For investigation of these questions, 96 students (24 


per grade level) were randomly selected from grades one = 
through four in a Central New Jersey township school. The 


4 oo 


5, BWVT was administered orally to all four grades; and as a 


fy 


reading test to third and fourth grades. 
The estimates of aural understanding vocabulary size 
were first grade: 1,780 Bante words; second grade: 2,330 
basic words; third grade: 3,780 basic words; and fourth | 
grade: 4,000 basic weuae>.. hese results are much lower . 
than results of previous mS 


aa 


2 : 


\ fire. gee 2 


wis . \A one way sii ale of variance and Scheffe! s tests on 
my , detuned, were per formed on ae Oral means. dave was a, a2 : é 


ee a \ main effect of grade (F + 32.06, f = 3/92, pes 01), ci 


\ oS 

? all pairwise Venpereees were cignificant at the .01l, level. 

pe eS Significant differences between the oral and the read- 
ing means for third and, foré fourth grade were verified by t 


A | Tasts. The t ahieaak 6.9 for third grade, and 6.3 for 
% fourth deade were both significant at the .001 level. 

te ' The correlation between th@ oral BWVT and 1) Gates- 

MacGinitie vocabulary was Pa i (p < .01), 3) Gates- 


MacGinitie comprehension , was .59, (p< .01), and 


3) Kuhlman-Anderson IQ was ,29, (p < .05). The low .29 


correlation may have been due to the IQ data being ofd. 


. 


s 


¢ \y 
: The experiment showed the feasibility of administering 


the Se oraliy. The low vocabulary size estimates are 
: \ 


e more in adoordance with the early "vocabulary size estimates 
4 | 


~ than with estimates dependent on the Seashoresfokerson 


test. This is due to Dupuy's strict criteria in defining a 


‘ 


basic word. \ 


,a f ik , oy 


/ 


\ ~ " ? : i A 
PRIMARY GRADE UNDERSTANDING VOCABULARY 


AS MEASURED BY ORALLY ADMINISTERED 4 


BASIC WORD VOCABULARY TEST 


ie \ 
A THESIS 
. SUBMITTED TO THE FACULTY = od 
OF “THE GRADUATE SCHOOL OF EDUCATION a 
or * > i 
ms , " o ;% 
RUTGERS Bi : 
THE STATE UNIVERSITY OF NEW JERSEY r 
‘ p 2 . BY 
- 
' MARIA L. BAYER 
2 | oe oR ; 
IN PARTIAL FULFILLMENT OF THE , || 
A ' REQUIREMENTS FOR THE DEGREE », a 
OF 

MASTER OF EDUCATION ~ y 

nel . 
NEW BRUNSWICK, NEW JERSEY . OCTOBER 1976 pe 
; - 
: 4h, 


a .. 


APRROVED:, 


EDWARD sia 
\ 


; 4 ° \ 


: MARJORIE ARNOLD ° 


; JOHN SANTA 


4 DEAN: 7 
Fo MILTON SCHWEBEL. 


ne 


af 


Ri 
; ‘ 
( 
1 , J : a 
ae ‘ ; ® 
ci , ‘ “ACKNOWLEDGMENTS . 
I would like to express my appreciation to Dr. Edward 
‘Fry, Director, Reading Center, for his encouragement and ¥ 


, Petpeul suggestions.during the préparation of this study. 


I would, also like to thank Dr. Marjorie Arnold for her , 


comments and questions which made the writing \easier, and 


Dr. John Santa for his advice on seme of the statistical \ 


ty 


writing. : | ; ‘qe ba 


Gratitude is expressed to Mr. Rubin Feldstein, princi- 


pal, Irene E. Feldkirchner school, and to all the teachers 


for théir cooperation in the testing procedure. 


I would like to especially thank my husband Douglas, 


| ’ 
and son David, for their help c support during the time I 


paid more attention to “the thesis" than to them. 


\ | ne 1 22 a 
= 


=! 


2) a 


gs 


TABLE OF CONTENTS 


, \ 


nF oe ‘ Page 


ACKNOWLERGEMENTS . 6 «© © es & ee 6 ee eH Uw 15 


pre oe TRREES: 3-60 oR ae oe ee ee v 
‘ \ 
Chapter i Bi 

Ey STATEMENT OF THE PROBLEM ....,. « © « 2 1 

Background of the Problem ........ 1 

Statement of the Problem it Ae ee el ee 5 

Importance of the Problem ........ 6 
Definition of terms . .. -. +s 2 + se @ 7 

Limitations of the Study... . 3° go 8 

, : 4 ‘i ‘ co 

Overview of the Study .. 2. 1% 1 oo y — 9 

4 7 e ’ ( 
of ’ q 
II. REVIEW OF THE tYTERATURE PS te. ge Se Goa we 10 
T¥en s ° e ° . e e e e ° ° e e e . e eS e 10 
ae ae : 

pi Methods . J e ° e J e cae e e e e ° . e ° 13 

Progleme oy 4g Soe ay wow awed 15 

: Definition of Rake Se a th es oak, Uh 15 

Definition of "know" it en eal an a 17 

" Definition of vocabulary ...+s+s.es LS 

Errors in sampling eet & wee ae ZO 

Neste: 4.49.04 aos EE “eos Re oe ae 


Summary . . a ee ree ae ee ners 


4az ' 
111 R 


_ Chapter 


TABLE OF CONTENTS (continued) 


4 . 


ee 


TIE, RROCEDURE 06 oa" id aw Sv S88 


eae” £ 


IV. 


Vv. 


BIBLIOGRAPHY veg ak, ees 


APPENDIX 


Population ws lb eres el ee oe, sat Sener oe 
trument ... .. - et 

fie Btuaye sn a ve) ae 

Design Sho (eh rue a wah neon tar ee ee 


Data Collection ... it. s «se «6 


Data Analysis ee ee eee 


. = . 
FINDINGS AND DISCUSSION. ..... 


Presentation of Data ee ae ae 
Significant Differences .... . 
Correlations ¢ ale SS 
Discussion . or ae oe 
SUMMARY AND CONCLUSIONS 1... . 
Summary oe ee ee ee ae 


Conclusions «sets & 6 ee ce ee 


Suggestions for Further Re 


A. Sample Basic Word 


VOCADULALY TESE*. fa aoe 6 ae Se ee 


« 


55 


g 


Table 


Li 


° 


’ 


LIST OF TABLES 


Some Estimates of English Vocabulary 


Size, by Method Used. ... 2. ae coe 


- 


Means\for Oral BWVT, Reading BWVT, 


4 


Vocabulary Size, Ages, IQ, and Gates- 


es ate Reading Test, by Grade Level . 
t 


\ 


Analysis of Variance Among Grade’ Levels . 


Aualysis of Mean Difference Among .Grade 


Levels using Scheffe's Tests 


4 


\ 


on 


Contrasts 


t Test Between Oral Administration and 


Reading Administration o€ the BWVT for 


Third and Fourth Grades .... Ms... .s 


’ 


s 


Coryelation Matrix.of Oral BWVT Raw Scores, 


Comprehension Grade Scores, and 


v 
Kuhlman-Anderson IQ Scores. . . 


‘ 


°’ Gates Vocabulary Grade Scores, Gates 


12 


36 


39 


40 oa’ ,r 


41 


CHAPTER 1] 


- THE PROBLEM 
4 ' 


: 4 . ” ; 
An important phase, of reading instruction is vocgbu- 


lary development. - A vast amount of literature exists on- 


‘hae to develop vocabulary and many investigators have at- 
tempted to measure’ the vocabulary aise of children and 
_ adults. However, when vocabulary size has been studied and 
‘results from ‘similar *groups compared, the results sre nok 
ee atemene Some vocabulary size estimates are 10 times 
the size of estimates made earlier (Lorge & Chall, 1963). - 

. Lorge and Chall (1963) concfuded that the reported vo- 
cabulary size variations were due to the differences inf 
meticdaisates used by. various investigators. Two major 
methods for estimpting poesbelary size are the sampling 
method and the use method. With thé-“sampling method, words 
can be systematically sampled fron a dictionary Or 
Thorndike's word counts. Usually ieee nth word = on- 
predetermined pages, or: every nth word in a list iw Batdate 
ed for testing. Another technique in samp] jag depends on 
using .a table of random numbers to locate the words tis 
ss ena for testing. The vocabulary size estimate is 


calculated by multiplying the sample number correct by the 


tatio of the source size to the sample size. (Suppose a 


i he : e. 


10 


v 


e * 


avy . \ é ; 


sample of .100 words was drawn, froma source containing 


= 


10,000 Words. A subject ‘might pget 80 of” the \100 sample 


, 


“words correct. The subject's vocabulary size would be es- 


timated at 8,000 words, since 80 X 10,000/100 \= 87000.) 


This method limits the total number of words that a person 


knows to the size of the source, and depends on ‘the persons 


‘recognizing (usually by reading) the words. Seashore and. 


EckerSon (1940) used the dictionary sampling method, ih 
determining the vocabulary size of college students. 
The use method depends on a person's producing (rather 


than recognizing). words either through speech or writing. 


-The vocabulary size is estimated by totaling the different 


words used. A limitation of this method is ‘that people 
wilt not use all the words they know in a given s¥tuation, 
as pointed out by Seashore and Eckerson coe ee One inves- 
tigator who used this method was Rinsland (1945). He es- 
timated children's vocabulary ,size based ‘on their writing. 
With both methods there are difficulties, such as the 


definition of a\"yord". Tf "bat" is counted as a word, how 


Lf 


‘is "bats" to be counted? One may argue that it is simply a 


A / : 
plural and should not be counted as an additional word, or 
; a = 
investigators may count it as a separate word. Another 
difficulty with counting a "word" arises when the meses 


are considered. Is "bat" to be counted as one-word or two 


words when a person knows it iS an animal as well as a 


\ F 
piece of baseball equipment? In just the unit of measure- 


”~ 


ment, there is a source of vocabulary size\variation,* due 
a ' 


11» ae 


: * 
, | t 
e 
e 
wv 
NS 
tae 


? # i ; 
to lack of agreement on the definition of "word". | 


Another difficulty wh the definition of sees Anete 


‘ | 


is a whole range of "know" from being able to define the 


word easily, to being uncertain, to not even knowing if the 
collection Les | is a word (Dale, 1965). ae mae be 
able to define "batted" the baths but is it as easy to de- 
fine "batted" an eye? Does "enervate" mean to increase or 
to decrease’strength? qhat does "“praedial" mean? One in- 
vestigator may require choosing one of three alternatives 
to define a word. ,Another may require the subject $0 pro- 
ducef a synonyit., Not only is the criteria for know" dif- 
‘ferent, introducing another .source of size waEtaelon, but 
there are two different vacabularies being testad. - . 
Thus, another problem area is the designation of voca- 
, bulary type. It may be general knowledge that people have 
different-sized vocabularies’ for speaking, writing, read- 
‘ 
ing, and aural understanding. But the designation of which 
specific vocabulary was icnaneea is-gften omitted in a gen- 
eral ‘statement like, "Pirst-graders know 12,000 words." 
There is quite a difference if this number refers to a 
speaking, a ‘reading,’ or a Writing vocabulary. Childsren, 
Particularly in primary grades, will have smaller writing 
vocabularies than understanding vocabulariés ‘because they 
are just’ learning to read and write. As people mature, the 
Wedabulary sizes.change with raspaee- te each other, .i.e 


—— oe . 
the redding vocabulary becomes larger than the speaking vo- 


j 
cabulary. One may understand "enervate" within its printed 


: = 12 


context, but, not use it in speaking. Obviously, measuring 


_vocabulary size is a task requiring several considerations. 


It was Lorge'and Chall's belief that different methodolo- 


r 


gies with the aforementioned difficulties caused the vari- 


ance in vocabulary size estimates. 


Similarly, Dupuy (1974,p.2) felt the inconsistent vo- 
4 ‘ Ms ‘ 
cabulary size results were due to the investigators' 


failure to define clearly four a&Spects of measurement: 


’ re . i : . 
(1) definition of the unit of. measurement--the word 
" > 


(2) estimates of the word population 


(3) basis for sampling, e.g., the size of the diction- 


ary or the nature of the use situation from which. 


the sampling for the test was jtaken, and 
(4) criteria used in determining word knowledge. a 
With these ‘points in mind, Dupuy constructed the Basic 
Word Vocabulary Test (BWVT). It measures both absolute vo- 
cabulary size (total number of words that an individual 
knows) and relative vocabulary size (size of the 
individual's vocabulary in relation to vocabulary size of 
other groups). 

Dupuy saveranea criteria for each aspect of! measure- 
ment. First, the estimate of the word population was taken 
from Webster's Third ele eebarbabionsl Dictionary. Main 
entries from one of three columns on every tenth pdye were 
counted. Only one of all similar homographs was counted; 


abbreviations were counted, but not affixes.@ The process 
2% 


yielded a rounded population estimate of 240,000 words, 


ii 13 ‘ 


JZ 


™~ 


from which a oS aes ssqole was taken. This sample was 
fur r reduced by Basic Words having to meet these re- 
quirements: they had to be main entries in-all four speci- 
fied major dictionaries, they had -to be single word forms- 
-not compounds, hyphenated forms, or abbreviatiens-- and 
they could not be designated as a technical, ae eae 


/ 
slang, or archaic word. 


< 


\. A-word was "known" when it was correctty chosen from 


five alternatives on 4 multiple choice reading test. 


Strict criteria were also used in the construction of the, 
correct.responses and the distractors for each item. Items’ 
os ; 


_were ranked according to difficulty datetinined in pretest- ° 


ial ns 
, 


g situations. 
he BWVT is a multiple choice test whose range is 
‘X 


third grade through Bee evel: A problem with the test 


~ aS that it is too difficult for first and second graders, 


Since they lack the necessary decoding skills to read the 


test items! How can their vocabulary be assessed then? . In” 


discussing testing the word knowledge of those with limited 


reading ability, Lorge and Chall (1963, p. 149) state that 
",..the, problem may be avoided by administering the test 


orally." 


4 


Statement of the Problem 


The present study is designed as a downward extension 


\ 


of the Basic Word Vocabulary Test by administering it oral- 


ly. The major questions to be studied are 


. 
« = : 


i4 


. 4 
(1) How large are the aural understanding basic word 
vocabularies of children in grades one through 
four? 7 
(2) od sunt bieane differences exist Baan grade lev- 
els? 
0 Z. ; 
(3) Do significant differences exist between the oral 
BWVT scores and the peneane BWVT scores of third 


9 


and fourth grade éhildren? 


(4) What "eoreelations exist DEEWE RN the oral scores 
and IQ, and between euais scores and the Gates- 


MacGinitie Reading Test results? 


Importance . 


The douneard extension (oral administration) of the. 
BWVT provides a needed eee? that measures both 1) the, abso- 
lute (cardinal number result) sad 2) the relative (ordinal 
value result) aural understanding vocabulary sizes of pri- 
Mary grade children. At present, there is nd other current. 
instrument which can give both types of measurement. 

It is also important to understand that the unit of 
measurement--the ee ee a very 5 aieeae defined 
ince There should be no edntueion as to whether cott> 
pounds, technical words, or variant forms are, included, 
thus inflating the ae re size. : 
The main purpose in. using the BWVT is-to gather accu- 

t 


rate baseline data on individuals. By assessing the abso-, 


lute and the relative size of a student's vocabulary, the 


15 


. 


examiner would gain knowledge about the individual, and how 
e v ; , 
he compares to the group... The individual's growth patterns 


could be monitored. _ 


As a diagnostic tool, the HVT should be “useful for 
indicating which dudividuate eel extra help in vocdbulary 
building. Teachers could pale these children for’ 'in- 
‘struction. “| ane ie 7 “ eh 

Group scores could provide a Sabie fare cecgnmendation 
in ‘curriculum change, e.g. should more stress be placed on 
vocabulary instruction? Which words should be included in 
primary réading lessons? | 
Definition of Térms _f? % 

Absolute vocabulary size. The actual sizé of one's 
vocabulary expressed in a cardinal number, e. g., 12,700 
words). \ 

Basic word. A word that meets Dupuy's five criteria 
of being 1) a fain entry in all four of these dictionaries: 
Book, and Funk and Wagnell's; 2) a single form, not com- 
pound or hyphenated; 3) not a proper name or abbreviation; 
4) not foreign, archaic, technical, or slang; and 5) not a 
derived or variant form. : | 

Relative vocabutary size. The size of one's vocabu- 
lary relative to another individual or group, expressed in 
a rank value, e.g. the 50th percentile. 


Vocabulary. The collection of words fot which people. 


s 


16 


oe. 


a 


have meaning, especially the words understood when people 


hear them. 


Word. When “word" is used alone, it refers to a lexi- 


cal entry in the dictionary. ‘ 
: Saas ; 
CLG \ 
Limitations { ' 


‘One of the problems in assessing vocabulary size a 


: determining when a word is known and ths counted in one's 
vocabulary. / In the. BWVT, examinees have to be able to. 
demonstrate only one meaning of the bacie gore by selective 
one of five choices. This procedure does not allow an ex- 
aminee to sia how ,many meanings of the basic ward he 
knows. Thus, the BWVT measures vocabulary breadth, not 
depth. It must also be ds ce that the BWVT, used in 
this way, measures aural understanding vocabulary size, not 
reading, speaking) or writing vocabulary sizes. | 

Another limitation of the present study is age 
of subjects oe She one & four grade range. Due 4% the 
time involved in testing small groups of four subjects in 
first and second grades, the population had to be restrict- 


ed. Only 24 subjects per. grade level , for A ae of 96 


subjects, were used. 


Data for IQ are above the mean and data for.reading * 


) | iit 
levels are above grade levels. This indicates the popula-' 
\ 4 


tion is above average, and the results should not be pro- 


*jected onto other groups. 
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fe 


Overview of the Study - é. 
Chapter Two is a ‘review of the literature on vocabu- 
lary studies. . The trends in research, different methods 


for i i aia eae size, and inhereart~ problems are 
i a 


included. ; ee, 


y 


Chapter PES is a description of the procedure for 


oe 


y Wer 
extending the BWVT downward and #or obtaining the aural 


basic word vocabulary size of primary grade children. 


- Descriptions of .the population, ;,Dupuy's, test, pilot test- 


ing, data collection@#end data analysis are included. 


. e ; 
Chapter Four is a presentation'of the data and discus- 


sion of the findings in relation to the literature. Data 


4 


include the means of the oral scores by grade level and an 


analysis of variance of the means, t tests between oral and 
reading scores of the BWVT by grade level, and a correla- 
tion matrix of the oral scores, Gates-MacGinitie Reading 
Test scores, and the Kuhlman-Anderson IQ test scores. 
Chapter Five is a Somme’ and a presentation of the 
conclusions of this study. Suggestions for further 


research are ineluded. 


. 


-_—A 


‘CHAPTER II 


Se 


REVIEW OF THE LITERATURE 
: f~ “ E E 
wee , : , 
This experiment was an attempt to extend the Basic 
Word Vocabulary Test (BWVT) downwasd shesugh oral adminis- 
tration. Given as a réading «test, the instrument adaeicae: 
reading basic word vocabulary size, but given orally, it 
assesses the autal waderekunding basic word vocabulary 
size. The ares of the iieeceeies includes the trends, 
methods, and problems of vocabulary size research, and com- 
ments on vocabulary tests. The trends dection states when 
most of the studies were done and what is recent in the 
area. The methods section describes dictionary and fre- 
quency sampling as. well as methods based on use. The prob- 
lems section deals with the difficulty in defining a word, 
in defining “know” and testing it, with the need to define 


the kind of vocabulary being tested, and lastly, with er- 


rors in sampling. 


Trends 

Most vocabulary size studies were done in_ the 
nineteen-twenties to the nineteen-fifties: Terman(1916), 
M.E. Smith (1926), SDolch (1936), Seashore & Eckerson 


(1940), M.K.Smith (1941), Rinsland (1945), and Shibles 


10 
19 
we 


11 


. 
¢ 


(1959). The early studies by Terman, M.E. Smith, and Dolch 
revealed primary grade children knew less chan 4000 words. 
The later naa by M.K. Smith and Shibles showed primary 
grade children knew 24, 000 words or more. Controversy over 
the estimated size variations has .continued for years. 
(Table 1 1S summary of the results of some previous stu- 
dies in vo@abulary size research.) Finally, Lorge and @hall 
(1963) analyzed the methods used and contended that the 
“reasons for vacabulary size discrepancies lie in ‘the pro- 
cedures used to estimate size. These procedures are ela- 
borated in the following Metholis section. Table 1 is a 
* summary of the results of some of the studies made in voca- 
bulary size. 2 
. Little research has been done in the area of vocabu- 
lary size recently. Ames (1964) constructed a test, using 
the dictionary-sampling method, to find the understanding 
vocabulary size of first-grade students. His data include 
‘a mean vocabulary size of 12,456 words, with a standard de- 


viation of 2,952 words. ERIC, Current Journals in Educa- 


tion, Education Digest, Psychological Abstracts, and 


Dissertation Abstracts were searched for studies made in 
the last five years in the areas of basic vocabulary, voca- 
bulary studies, and vocabulary development. Dupuy's Basic 
Word Vocabulary Test is the only vocabulary size and 


development study undertaken within the last five years. 
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TABL 


SOME ESTIMATES OF ENGLISH 
VOCABULARY SIZE BY 


| : METHOD USED ™ 


; VOCABULARY AGE OR 
AUTHOR (DATE) SIZE EDUCATIONAL 
) ri a LEVEL 
Dictionary Sampling 
nn 
— Ames (1964) - 12,456 first graders 
» M.K.Smith (1941) 24,000 first graders 
Shibles (1959) 26,000 . First graders 
Dupuy (1974) 1532 third graders 
Terman (1916) ‘3,600 third graders 
‘M. Ke. Smith (1941) 44,000 third graders 
Dupuy (1974) 4,419 seventh graders 
Terman (1916) 7,200 . seventh graders 
Holley (1919) : 8,478 seventh graders 
Kirkpatrick (1907) 10,666 seventh graders 
Seashore (1933) 65,000 college sophomores’ 
Gerlach (1917) 85,300 college sophomores /’ 
Frequency Sampling | 
M.E. Smith (1926) 2,562 first graders 
Dolch (1936) 2,703 first graders 
\ a 


Use: Spoken 
aaa a a aaa aa a SIRE aE Ca 


Horn (1928) 3,145 one five year old 


p 


»d 


Use: Written 


Rinsland (1945) : . 5,000 first graders 


_ 
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Methods 


—V 


FOr ; ' SEES tye 
Different methods of estimating vocabulary size in-_ 


clude methods based on use, frequency-sampling, and 


dictionary- samp] ing (Lorge & Chall, 1963). Methods based 


»] . « 
on use means estimates are made on the number of words used 


a 
in writing or speech. For example, Rinsland (1945) counted 


the words children wrote in compositions, /(Storjes, and 
~ = i ; 

letters. -He found that first graders knew ‘over 5,000 
af 


words. For _ Spoken vocahplary measurement, M. Horn, in 
1928, reported using a Mictaphone and parent. checklist to 
find that a five year-old girl knew over 3,145 words (Dale, 
1956, p.115). Buckingham and Dolch (1936) used the free 


¢ 
response method to determine what words children from grade 


8 grade knew. They directed children to write down all 
the words that they could think of in 15 minutes. While 


the objéct of they study was to create a word list, Bucking- 
ham and Dolch also’ considered vocabualry size. At the end 

r em : , ; 
of first grade, children knew 2,800 words. With these 


methods, vocabulary size is limited to the words chifldren 


use in the particular situations. Seashore (1940, p.19) 


t . 


states: 
; 
In certain studies of children,it is possible to 
record all of the'words used if a variety of si- 
buat isne over a period of days until no new words 
appeaér except those Yearned during the period. 
This gives a measuré Of the number of words which 
the individual did e, but it does not take into 
account those which he could,have used if neces- 


a sary, nor those which he cou only recognize. 


Vocabulary size can also be estimated by frequency 
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sampling (Lorge & Chall, 1963). In this method, 
Thorndike's books of frequency word counts are used. Words, 
are taken from each specified frequency level, and the sub- 
ject’ is tested on them. The vocabulary is estimated by av 
ding the number’ known at each Ilevel. For . example, 
“yonnaihe te A Teacher's Word Book of 20,000 Words might be 
used to test a student. Words could 7“ selected from the 
commonest thousand, second thousand, and third thousand 
frequency Vevets for testing. The student May get correct 
halt: he words from the commonest thousand, a guarter of 
the words from the second thousand, and none from the last. 
The subjéce te. youabayaey size oe be estimated at 750 
Lis 500 (1/2 X 1,000) + 250 * x 1,000) + 0 (0 A a2 
1,000) = 750. In the frequency-Sampling method, vocabulary 
size is limited by the number of words in the’ lists-- 
10,000, 20,000, or 30,000. M.E. Smith (1926 found that 
first graders knew 2,562 words using the 10,000 word list. 
. The dictionary-sampl ing method involves selecting a 
dictionary, taking a sample of words from it, and testing a 
subject on the sample. To vat the subject's vocabulary 
size, the numbeg correct on the sample is multiplied by the 
—Yatie of the dictionary size to its sample size. For|\exam- 
ple, a sample of 100 words may be taken from a dictionary 
having 50,000 words. A subject gets 90 words correct on 
the sahiple. It is! estimated -that ‘he would know 45,000 
words since 90 x 50,000/100 = 45,00Q. This method limits \ 


oe 


—T 
the number of words a person can know to the size of the 
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dictignary from which the sample was drawn. | Ames (1964), 
used this method in finding the mean ‘first sends yeoabalary 
size to be 12,456 words. 

Ideally, an investigator could count all the words in 
the English language and bent an individual to determine 


new many words he knows. The dictionary sampling method 


comes closest to this, but there are inherent problems. 


- 


Problems Laan 

Many problems exist in vocabulary size cueaieswete: hee 
include 1)defining a word, 2)defifttiing "know" and testing 
it, 3)failure to designate which vocabulary is eins 
ed, and 4) sampling errors. Each problem is discussed in 
turn. 

Definition of "word". Since investigation of vocabu- 
lary size began, there has been no unanimous agreement on 
what exactly constitutes the basic \nit of measurement--the 
word. Obviously, different wares’ of measurement would 
yield variations in estimated jesabutaty size. This is [ 
peoblem in all methods used. Generally, most investigators 
accept a lexical entry in a dictionary as one word, and in- 
clude wéth: that one word the variant forms such as plur- 
als, comparatives and superlatives, and participles. When 
the variant forms are counted separately, the Vocabulary 
estimate would be larger. 


This disparity in method is illustrated by three stu- 


4 Z 
dies. First, Ames (1964) randomly selected entries from a 
7) 


2! ‘ 
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dictionary which were printed in bo\dface type. He did not 
ificlude derived words or words formed from the basic words, 
an 
which were in medium type ahd indented. Secondly, Seashore. 
and ‘Eckerson (1940) considered both basic and’derived words 
in.a dictionary. Basic words were typed in boldface print 
Z : 
next to the margin, derived words were typed in medium’ 
print and indented. The lists were combined to make a third 
“eoral eee? list. Thirdly, Seashore’ and Eckerson (1940, 
p.16)) cited Thorndike's rules for classifying. words as 
variations ‘of a word. A variation, ng@t to be counted 
separately, could .be one of ‘these: plurals "s" and "ies", 
adverbs ending in "ly", comparatives and superlatives, verb 
forms ending in “s", "d", "ed", "ing", past participles 
ending in "n", and adjectives adding "n" to proper nouns. 
Just these three examples of defining a word yield a 
different number of total words to be sampled. Seashore 
and Eckerson (1980, p. 16) stated their method of counting 
was different than Thorndike's: 
By counting all of the words in the dictionary 
which would be grouped with) our basic sample ac- 
cording to Thorndike’ s definition we find that, 
in proportion, our estimate of three hundred 
seventy thousand two hundred sixty as the total 7 
number of words in the dictionary would be 
A equivalent to. a much smaller number of words ac- 
cording to Thorndike's definition. 
‘Seashore and Eckerson' s list would also be smaller (limited 
to their basic word list) if counted by Ames' method. 


Dale (1965, p.898) followed Thorndike's rules in dis- 


A 
cussing counting inflectional forms and the age of the sub- 


f 
« B 
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ject being tésted. He counted a word as new "when the in- 


flection is likely to make it -markedly more difficult. 


° 


Thus you may know like but not likely. This problem is 


greater for the younger child who has not generalized the 


regular endings."- But just ,how one defines “markedly more 


difficult" is not described, and seems imprecise. 


Definition. of "know". What does it mean to "know" a 
‘ whe A 


word? Not only the depth of meaning but the degree to 


which one)realizes that a group of sounds or symbols con- 
stitutes a word can be discussed. Dale (1965, p.898) 
; apts: 2 ieceae ; 
described the range of knowing a word. On a continuum, one 
may 1) never have seen: the word, 2) know there is such EJ 
word but not know its meaning, 3) haveg"a vague contextual 
placing of the word.", 4) recognize it again and remember 
it, or 5) require very fine distinctions, etg. between "sym- 


pathy. and empathy." Dale believed the nature of the test-- 


that is, where on the continuum a word is to be defined-- 
we “ 


influenced the size of vocabulary, and thus, wasS a source 
of estimated*size differences. ; 

Dale (1965, p.897) was also concerned by the lack of 
asSsesSing depth of meaning. He stated, "One ‘of the major 
deficiencies of mer lists which are Sor paReeg concerned 
with meanings, not spelling, is that they have not usually 


studied frequency of meanings." Then he pointed out, howev- 


eee t— 


glish Words, published in 1938 by Thorndike and Lorge. 
% ’ 


This list gives the frequency with which different meanings 


b ® 
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‘of words in print occur. lLorge and Chall (1963) were simi- 


larly concerned and ask whether one has to know all mean- 
ings of a word or only the commonest meaning to "know" the 
word. 

Lorge and Chall (1963) also raised questions akout 
testing knowledge of a word. To indicate that a word is 
known, one may define it, us& it in a sentence, check it on 
a List; OL pick it from .four or five choices. Multiple 
choice tests offer the convenience of short saniietacracied 
time, ease of correction, and relative inexpense. But 


there are some difficulties in this method, mainly in the 


‘possible combinations of knowing the key and the choices. - 


A subject may know the key, but not in terms of the 
choices, or may not know what the choices mean. The sub- 
ject also may not know the key, but know all the Ghsises: 
One is given credit for knowing only the key word. Lorge 
and, Chall questioned whether one should not also be given 
credit for knowing the correct alternative. 

Another problem in testing word knowledge is the need 
to decode words on a reading test. A subject can fail a 
word because he is not able to decode the key or the alter- 
natives thoygh the meanings are known. Oral administration 


alleviates this problem. Lorge and Chall (1963) suggested 


: 5. ee 
giving a test Orally to those people whe can not recognize 


_a word in print, if the fuged Meaning is being measured. 


Since ‘many children in primary grades have difficulty 


decoding words, oral administration would allow measurement 
” j e 5 
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\ 
of word understanding. However, the task becomes-measure- 
ment of understanding vocabulary, not reading ‘Yocabulary. 
Separation of ,the reading process from word under- 
standing was further expressed by Jackson (1973, p. 9. He 
was concerned, with what vocabulary development is about. 


; = 
He stated,"...when we are teaching vocabulary we are at- 


tempting to foster in the youngster an increasing knowledge 


of his world and to develop the representation of this 
,knowledge through verbal symbols or words." Much of this 
development accurs orally and ‘Jackson feels: that once a 
meaning is ceepaaenced by an oral symbol, recognizing the 
visual symbol is irrelevant to vocabulary development. The 
child may have a problem in word Hamangs but not in vocabu- 
lary development. 


Designation of vocabulary. Another area that. causes 
confusion is the failure to state which vocabulary is being 
measured. We neve listening (aural understanding), speak- 
ing, writing, and reading .vocabularies. These are not the 
same size, and their relative size toward each .other 
changes with age. At ffirst one's reading and writing voca- 
bularies are very smal} compared to his speaking and aural 
understanding vocabualry. With age, one's reading and 
writing vocabularies overtake speaking and aural under- 
standing vocabularies. Lorge and Chall (1963, p.148) 


stressed the need for stating which vocabulary is being 


measured for young children: 
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Particularly for first-grade ehildeen whose read- 

ing and writing vocabularies are first being 

developed, it is essential that estimates be un- 

derstood in terms of the method employed--whether 

based on words spoken, understood when heard, or 

understood when read in print. 
When Rinsland °(1945) found cir eLeseeders knew about 5,000 
words, it must be remembered this was writing vocabulary 
size. Ames (1964) found first graders knew 12,456 words, 
which was the mean understanding vocabulary size. Lorge 
and Chall "(1963, p.148) cite Neher's study: “Estimates . 
based on speeci»pr writing of adults are usually lower than 
those obtained from sampling methods, since a person*a~~ 
recognitional vocabulary is considerably greater than his 
expressional vocabulary...” | 

Errors in sampling. One cause of variations in voca- 
bulary size estimates is -errors in sampling frequency 
counts. In general, these errors tend to limit the es- 
‘timated size of one's vocabulary. Dale (1965, p.897) made 
the-following criticism of frequency-samp] ing: “Investiga- 
tors miss simple words which may not appear in print but 
are ce ae Common words are often omitted. Rinsland, 
for example, has “platypus” but not "pimple", "mucus" but 
riot: “snot™.4.%. © é ¥ 

Errors in sampling dictionaries can either limit oy 
inflate vocabulary size. Problems in using this method in- 
clude dictionary size, editors' selection of words to in- 


clude in a dictionary, and the space allocated to the words 


(Lorge and Chall, 1963). There is also a problem of recen- 
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cy. The dictionary which is being sampled limits the size 
of’ one's vocabulary to the size of that dictionary. A 
small dictionary may not have some of the words a person 
knows whereas a large unabridged dictionary has a greater 
chance of having included these words. 

Colvin (1951) cites several studies using abridged and 
unabridged dictionaries. The early studies utilized 
abridged dictionaries. A sample was taken with each word 
representing 10 or 20 words. Terman found twelve-year olds 
knew 7,200 words. The same list of words was used, but a 
different type of test given, in whieh Holley found 
twelve-year olds knew 8,478 words. When Kirkpatrick tested 
seventh-graders on a list of 100 words from ‘Webster's 
Academic Dictionary, they knew 10,666 words. Later studies 
used unabridged dictionaries. Gerlach found college soph- 
mores knew 85,300 words, while Seashore, using a different 
dictionary, found college sophomores knew about 70,000 
words. M.K. Smith found seventh-graders knew 55,000 words. 
These figures support Hartmann's (1941) findings that —e 
bulary estimates depend on the size of dictionary sampled. 


i] 
Another problem Lorge and Chall (1963) point out is 


the editors’ select\ion of words to include in a G@ictionary. 
An abridged dictionary has a biased sample of the more com- 
mon words from the total English vocabulary. In unabridged 
dictionaries, editors may have cut old or uncommon words. 
Thus, even a sample from a large unabridged dictionary is 


fa te 
not 4 sample of the un ler se of the English language. 
¥ 
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While old words may have been cut, what about new 
words being added? Little mention was made in the litera- 
ture about the recency of the dictionary being ‘sampled. 
Studies that depend on the Seashore-Eckerson test (or any 
hese dependent on a sample from an earlier edition diction- 
ary) ignore the increase in vocabulary from World War II 


through present space exploration. 

Lorge and Chall (1963, p.152) cite Thorndike on an er- 
‘ror in dictionary-sampling which has to do with the space- 
sampling of words. "...selecting words falling in a par- 
ticular place in the dictionary, for example the first word 
on a page would overweight the sample with the most impor- 
tant words, since most dictionaries give more space to im- 
portant words.” 

Another cause of variation in vocabulary size, when 
using the dictionary-sampling method, is the representa- 
tiveness of the sample. The Seashore-Eckerson Test was 
critiqued by Lorge and Chall (1963, p.150-153). The major 
criticism was that Seashore and Eckerson had a biased sam- 
ple of easier words, since they did not adjust for multiple 
entries of words. Also, their basic words included names 
of people, geographical names, word parts, proper nouns, 
and duplicate spellings, along with the several entries of 
the same word form. Counting multiple entries was criti-~ 
cized because knowledge of the word was determined by 
recognition of only one common meaning of the word. ("Cow" 


may have been listed as eight words, but.if a student knew 


—3l 


‘ 


23 


it was ‘anjanimal, and no other definitions, he would still 
have been credited with knowing eight words.) Studies that 
‘ were based on the Seashore-Eckerson Test, e.g. Colvin 
(1951), Bryan (1953), and Shibles(1959), also had_ the 
results hétd in question. It was concluded that all these 
studies had inflated vocabulary size estimates. 
- 

Tests 

Most investigators in the literature constructed their 
own tests for meaSuring vocabulary size. Other than voca- 
bulary sections in battery tests, there are few tests 
designed specifically to measure vocabulary size of vaca 
children. Tests in Print II (Buros, 1974) has a list of 20 
tests that measure a range of types of vocabulary. These 
included tests for business and: professianals, tests ap- 
propriate for ages 18 and up, “or appropriate for grades 
nine and up. 
| Three tests appropriate for grades three and up were 
reviewed in Buros' Mental DEASHreMmenES Yearbooks. The 
‘first was The Survey Test of Vocabulary (Buros, 1949, test 
167), based on Thorndike's Teacher's Word Book of Twenty 
Thousand Words. It was more than 35 years old. No infor- 
mation was furnished concerning test construction, or when 
the norms were obtained. The second test was The Vocabu- 
lary Test: “National Achievement Tests (Buros, 1949, test 


168) which was more than 30 years old. It had no informa- 


tion on word selection other than "...the words used have 
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been carefully selected and graded." The third test was the 
Wide Range Vocabulary Test (Buros, 1949, test 169) which 
also was more than 30 years old. There were no data‘on re- 
liability, intercorrelatiorn between forms, or correlation 
with other well known measures. 

There are foreign language tests (English as a second 
language) available which measure English vocabulary. 
These tests were designed for older students and tended to 
check idiomatic expressions, however. 

The test developed by Dupuy (1974) is quite a contrast 
to the ones previously mentioned, in terms of word defini- 
tion, test construction, and standardization, The Basic 
Word Vocabulary Test is discussed in chapter Three under 


Instrument. 


Summary 


Controversy over estimated vocabulary size has contin- 
7 


/ ued since investigation began. Lorge and Chall (1963) 


~ 


pointed out methodological issues that contributed to vary- 


ing vocabulary size estimates. Although difficulties are 


present, the dictionary sampling method appears’. to 


represent the most words that a person could know. The 


‘ problems of defining a word, defining know, and sampling 


have not been completely settled or eliminated. Therefore, 


there is need to define each aspect of measurement in any 


study undertaken, so the reader can be aware and interpret 


results in light of these issues. ‘ 
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CHAPTER III 
PROCEDURE 


This study was an attempt to extend the Basic Word Vo- 
cabulary Test (BWVT) downward by administering it orally. 


Some investigations included measuring the aural under+ 


stdnding basic woth vocabulary size of primary grade chil- 
dren; finding correlations between Oral scores and IQ, 
between oral scores and reading scores; and determining if 
significant differences exist between the oral BWVT and the 
reading BWVT scores. ai | chapter is a description of the 
population of the study, instrument, the pilot’ study, 


research design, data collection, and data analysis. 


Population 
All the subjects in this study live in the township of 


Green Brook, New Jersey, and attend its one elementary 
school. Ninety-six subjects participated in the exper iment 
with an equal number of sijects (24) being randomly 
selected from grades one through four. It was felt this 
number of subjects would be manageable in terms‘of adminis- 
tering aad” Heorina the tests. Half the subjects were 
girls, half were boys. ih : r 


According to the 1970 U. S. census, the township in 
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which the subjects live has 3,838 innableants. Of the peo- 
ple who were 25 or older, 44% were high school graduates. 
The median income was $11,148, and mean income was $12,128. 
The labor force was comprised of 1,660 people 16 years and 
older; 64% were in sales, clerical, craftsmen, 
operatives,and transport positions.’ All subjects were 
white, and either Catholic or Protestant. 

All subjects' IQ's were measured when they were in the 
second grade. Each year in November, the Kuhlman-Anderson 
abet had been given by the second-grade teachers. heey 
there were no IQ data for children in first grade.) In his 
review of the Kuhlman-Anderson Test, Michael (Buros, 1965) 
said it “compares favorably with snvpether competitive in- 
Strument measuring general intelligence." 

The second-grade subjects' IQ's ranged from 99 to 140 
with a mean of 117, the third-grade subjects’ IQ's ince 
from 104 to 133 with a mean of 118, and the fourth-grade 
subjects' IQ's ranged from 98 to 140 with a mean of 1¥9. 
Fourth graders took a aad Abilities Test in Febru- 
ary 1976. Their scores ranged from 81 to 140, with a mean 
of 108. This test was not reviewed in Buros' Mental Meas~ 
urements Yearbooks. | 3 

Reading abilities were er ee by the Gates- 
MacGinitie Reading Test, Primary Forms A, B, and C. In his 
review of this test, Van Roekel (Buros,1970) found tie 


Standardization "to have been rather carefully done" but 


when "used alone, however, these tests probably function 
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We 
best as survey tests." Since the tests were given in May, 


‘75, reading abilities would be higher than indicated due — 
to passage of time and additional instruction. The second 
graders' vocabulary scores, ranged from 1.4 to 3.5, while 
the third graders' vocabulary scores ranged from 2.2 to 
5.2. The fourth graders' vocabulary scores ranged from 3.0 
to 6.8. No standardized test information on reading abili- 


ty was available for first graders. 


Instrument ' 

Dupuy's Basic Word Vocabulary Test (BWVT) measures: the 
absolute and the caiatine size of a literate person's read- 
ing vocabulary. Its range extends from third grade through 


Ph.D. The BWVT has high reliability and validity as a 


‘reading test. The overall internal consistency reliability 


is close to .96. Its standard error of measurement is 3.0 
raw score points (Dupuy, 1974, 5.30). 

For concurrent validity, the BWVT scores were corre- 
lated with scores on the School and College Ability Tests 
(SCAT), scores from five different tests of the Sequential 
Tests of Educational Progress (STEP), and with test scores 
from verbal sections of five standardized tests. These 
five standardized tests igcluded the Metropolitan Reading 
Readiness Test, the Lorge-Thorndike Intelligence Test, the 
California Test of Mental Maturity, the DLEr Sneaks peets 
tude Test, and the School and College Ability Tests. A 


median correlation of .76 was found. The correlations 
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ranged from .60 to .84 at Stages 4 to 12, but were .28,. 
-45, .46 at, first, second, and third grades, respectively. 
Fourth grade: correlation was .80. Dupuy (1974, p.30) 
states, "The low correlations for grades l, 2, 3 are con- 
sistent with the findings that the BWVT is too difficult at 
these levels when given as, a reading test." This is why 
this study is being done with oral administration. 

Basic word knowledge is highly related to age and edu- 
cational levels. Correlations of the BWVT and acubionsl 
level were .80 Foi grades 3-12 and .60 for grades 3-7. 
BWVT and age level correlations were .77 for 8-17 year olds 
and .55 for 8-12 year olds at better than the .01 level 
(Dupuy, 1974, p.3l). 

The content validity can be evaluated by reviewing the 
procedures Dupuy (1974, p.4-6) used to select the Basic 
Word Vocabulary. Criteria for drawing a sample of main en- 
try words from Webster's Third New International Dictionary 
were prepared. The dictionary has 450,000 entries. Main 
elegy words had to meet these criteria: 1) the word had to 
be a main entry, 2) all homographs were counted as one 
word, 3) abbreviations were counted, but not affixes, and 
4) letters were not counted. 

An estimate of the total number of main entry words 
was made following explicitly set procedures. Basically, 
every 10th page had one column sampled. Computations 
yielded a rounded population estimate of 240,000 words. A 


l-percent sample (2,400 words) was. taken by selecting the 
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third word *(same criteria applied as before) in one column 
On every page. The order of the columns was rotated. 
These 2,400 words were put into one of four eatenaptast 13 
compounds) or hyphenated, 2) proper names, 3) sibel 
tions, 4) others. Then, only the 1,360 “others” were 
checked against main entries in Random House, World Book, 
and Funk and Wagnall's. 381 words were main entries in all 
the four dictionaries. These words were reduced Sy eléns 
inating technical, foreign, slang, and archaic words, as 
‘Classified by Random House. 

Another set of criteria was developed to. classify the 
remaining 307 words as either basic or derived. A derived 
Or variant entry word was 1) another form or different 
tense of the same word, 2) a different spelling, 3) another 
word which gave a fuller definition, 4) a two or more word 
entry that used om@p of the words in the definition, and:5) 
a derived word whose meaning could be known if the base and 
‘affix meaning were known. A sample of 123 words resulted 
which have been carefully and frecisely ES en Oe 
ly, the test has excellent content validity. . 

The test (of the 123 basic words) is a multiple choice 
test that has five choices for each item--one correct 
response and four distractors. Similarly precise criteria 
for constructing the correct responses and distractors were 
developed. For example, correct responses had to be the 


mK, 
most common meaning (according to World Book Dictionary. 


They had to be easier (more frequently used) than the stem 
38 
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word, and usually were a one word synonym. Alliteration 
, ol 


~similarity to the stem was avoided. Correct responses were 
balanced in length 4 distractors, and in parallel form in 
tense or part of speech. Distractors had to be less diffi- 
cult acd the stem, in parallel form to the stem and other 
choice’s, and alliteration and spelling similarities were 
avoided. | 
When the test constructiom was completed, two pretests 
were given. About 50 of the pretest: subjects evaluated the | 
jeer, Then, standardization was done on 3,100 students in 
grades 1-12 in the Fairfax eounty Virginia public schools. 
The parents “represented a diverse background of social 
status, seeupations and geographic origin. Item analyses 
were made to find difficulty level, internal gponsistency, 
distractor effectiveness, and sex differences for each item 
(Guppy 719740111). By using the nationally standardized 
test scores of verbal achievement, grade ea cies RHI were 
developed and adjusted to reflect national norms. 


eos : Fa aR 
Upon administration of the test, a vocabulary size es- 


“ » 


‘timate can be easily calculated by .multiplying a subject's 
raw score by 100. There are 123 basic words in a one per- 
cent sample of the main @ntries in Webster's Third New 
International Dictionary. So, multiplying the number 
correct out of 123 by 100 yields an estimated basic word 
vocabulary size for an individual. In arriving at a figure 
for vocabulary size, it must be remembered that the unit of 


measurement is a basic word, and that the vocabulary being 
a - F - 
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measured is’an aural understanding vocabulary. 

In this study, the first section of the BWVT was used 
for Oral administration. Obviously, since the test's range 
is through Ph.D. level, primary grade children would not 


’ 


have to take the entire test. 


Pilot Study 
The pilot study for this experiment was done in Febru- 

ary, 1976, Five children per grade level (kindergarten, 
first, second, third, and fourth grades) were “earidonly 
_ Selected to participate in the study.. At each grade level, 
five names were drawn from a box. Each group of five was 
given the BWVT orally. The study aided in foreseeing diffi- 
culties that would arise in running the actual test. For 
example, kindergarteners could not handle marking an item 
with \ five alternative choices in. re group. situation. 
Perhaps a different format for the test or individual ad- 


Ministration would work at this grade level. 


Design et e oo 
It was felt that 24 students per grade level Yould be 


4 


a manageable number in terms of administration and scoring. 
Therefore, 96 students in first through fourth grades oer 
randomly selected for the Basic Word Vocabulary Test (BWVT) 
administration. Those students who participated in the pi- 


lot study wére not included in the sample who took the 


test; At each grade level, all students' names, except 


e 
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those who had participated in the. pilot study, were put in 
a box. A stratified random samplé--was selected by’ taking 
the first 12 girls' names and the first 12 boys’ names at 
each grade level. Investigation of sex differences could 
——-te easily andere; if desired at.a later date. | 

Students in third and fourth grade.were given the BWVT 
‘first as a reading test and one week \later, as an oral 
test. The dual administration provided data to compare 
reading and erat yooabulet ina: Students in first and 
second grades had oral administration only. 


Data on age, IQ, and RPaeany scores of subjects were 


collected from school files. \ 


Data Collection 
Subjects were tested during February and Magen. of 
1976. The BWVT was administered in the afternoon, with 
each group using approximately 35 minutes to take the test. 
Testing was done in groups of 4, 12, or 24. The first- 
graders and second-graders were tested in groups of 4 at a 
- time. The small number facilitated administration; the ex- 
aminer could make sure each student was on the current 
question. The third-graders were tested 12 at a time; the 
fourth graders were tested 24 at a time. 
The test was administered only by the experimenter for 
all grades. Children were moved to the experimenter's 
classroom for testing. The subjects were told the examiner 


was working on a project (like they did in school) to find 


Al 
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out how many words,children of different ages knew. They 
were assured that this test would not affect their report 
card grades and that the results would only be seen: by the 
examiner. Test booklets were. distributed and directions 
read to the subjects--they were instructed to mark an X 
next to the correct choice on each item. The examiner read 
each item (stem and five choices) aloud. 

The BWVT was not given in its entirety. | Since the 
items were ranked weaning to difficulty, children through 
fourth grade would not know words at the upper end. Enough 
items had to be administered to ensure the highest-scoring 
subject would get 10 errors. The highest scores from the 
pilot study helped estimate a cut-off point. If a subject 
had not made 10 errors, he was tested furt ‘en 

‘ Scoring was done by Dupuy's: 10-E an In this 
method, a subject's raw score is determined by counting the 


number correct, and counting is stopped at the 10th error. 


Omitted items are not included in the count. 


Data Analysis i 


For each grade level, the means and standard ,devia- 


tions from the oral administration of the BWVT were calcu- 


‘lated. Then the means were multiplied by 100 to get the 


estimated understanding vocabulary size of basic words. 
To ‘verify significant differences between first and 


var 
second grades, second and third grades, and third and 


fourth grades, an analysis of variance and Scheffe's tests 
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of contrasts were run. . 

The means and standard deviations of the reading ad- 
ministration of the BWVT.were calculated tor the third 
grade and ‘for the fourth grade. To get an estimate dé the 
third grade and the fourth grade reading vocabulary sizes, 
the means were multiplied by 100. 

To determine “if significant differences existed 
between bead and reading scores at the third grade and at 
the fourth grade levels, two t tests were made. 

A correlation matrix was run to find correlations 
between the oral BWVT and 1) the Gates-MacGinitie Primary 
Reading (vocabulary) Test, 2) the Gates-MacGinitie Primary 
Reading (comprehension), Test, and 3) the Kuhlman-Anderson 
IQ Test. The statistics shown used Pearson product-mament 


@ 
correlations. 


CHAPTER IV 


FINDINGS AND DISCUSSION 
A 


This chapter presents an analysis of the data obtained 
from raw scores on the Basic Word Vocabulary Test (BWVT), 
given orally to 96 subjects in first through fourth grades. 
Data also analyzed are the scores from the BWVT given as a 


reading test to the third and fourth graders. 


Presentation of Data 


The data from the oral administration of the BWVT, the 
reading administration of the BWVT, and data collected from 
students’ school files are presented im Table 2. The mean 
raw scores on the oral BWVT were 17.8 (S.De=5. yy 23.3 
(S.D.=4.6), 37.8 (S.D.=11.2), and 40.0 (S.D.=13.2)° for the 
first, second, third and fourth grades respectively. When 
the means were multiplied by 100 to produce the aural 
understanding vocabulary size estimate, it was found that 
first graders knew 1,780 basic words, second graders knew 
2,330 basic words, third areas knew 3,780 basic words, 
and fourth graders knew 4,000 basic words. 

The mean reading heutes on the BWVT were 27.6, and 
31.3 (S.D.=12.6) for the third and fourth grades respec- 


tively. 
4d 
35 


TABLE 2 


a — 


MEANS FOR ORAL BWVT, READING BWVT, VOCABULARY SIZES, : 
AGES, IQ, AND GATES-MACGINITIE READING TEST 
BY GRADE LEVEL 
(N = 96) 


BWVT ORAL READING BWVT GATES 
CA MA IQ GRADE LEVEL® 
VOC. COMP. 


lox 


SIZE SCORE 


fy 17.8 5.4 1780 82.0 ‘ 
> 9 339° de 2930 94.6 110.5 117.2 2.7 2.7 
3 37.8 11.2 3780 27.6 11.4 2760 105.7 124.8 118.2 4.3 4.1 
4 40.0 13.2 4000 31.3 12.6 3130 117.8° 138.5° 119.1© 5.0 4.7 


aN -= 24 for each grade. 
Pnata are one year old for third grade and two years old for fourth grade. 
“Data pre old--add .6. | 


d 
contrasts. 


> 


€mean is based on only 20 scores, since 4 of 24 subjects had no IQ data. 


“ 


Data are significantly qi tserent by analysis of variance and Scheffe“s tests on 
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The raw scores were multiplied by 100 to find the reading 
vocabulary size estimates. Third graders knew 2,760 basic 
words, while fourth graders knew 3,130 basic words. ; 

Data from school files include students! ages, IQ, 
and Gates-MacGinitie Reading Test results. In this school 
system, Kuhlman-Anderson IQ tests were administered only 
during second grade. Therefore, no data for first grade 
were available, and the data for third and fourth graders 
were old. The mean IQ's for second, third, and fourth 
grades were 117.2, 118/2, and 119.1 respectively. The mean 
IQ of the fourth grade is based on 20 scores only, since 
four subjects had no IQ data. Chronological age (CA) is 
calculated for age in months at testing time (February). 
(The grade level is actually an extra -5 since testing ocf 
cured in February.) Mental age (MA), also expressed in 
months, was determined by using the equation 
MA = CA X IQ/100. ' Reading test scores were from the previ- 
ous May, so students' actual reading levels would be higher 
(add .6 For six months) due to passage of time and addi- 
tional instruction. Mean Gates' vocabulary avails levels 
for second, third, and fourth graders were 2.7, 4.3, and 
5.0 respectively. Mean Gates' comprehension. grade levels 
for second third, and fourth graders were 2.7, 4.1, and 4.7 


respectively. 


Significant Differences 


It is expected that each grade level mean score on the 
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BWVT should be higher than the mean score of the preceding 
grade. To verify that the differences were significant, a 
- one way analysis of variance was applied to the mean raw 
score results on the oral BWVT. The analysis revealed a 
main effect of grade (F = 32.06, df = 3/92, p < .01), shown 
in Table 3. Further analysis of the main effect means, us- 
ing Scheffe's tests ari enbemcia, indicated that all pair- 
wise comparisons were significant (p < .01), shown in Table 
4. | | 

To determine if differences between the .oral .BWVT 
scores and the reading BWVT scores were significant, t 
tests were performed between the third’ grade oral and read- 
ing means and between the fourth grade oral and reading 
means. desuite are shown in Table 5. The t test performed 
on the third grade oral raw score awe and the reading raw 
score means was 6.9, significant at the .001 level. The t 
test performed on the fourth grade oral raw score means and 
th naw din raw score means was 6.3, aldo Significant at 
the .001 level. The problem of results being confounded by 


the practice effect will be taken up in the following Dis- 


cussion section. 


Correlations 
To find the correlations between the oral BWVT scores 
and IQ, and between the oral BWVT and the Gates-MacGinitie 


Reading Test sections, a correlation matrix was calculated. 
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TABLE 3 


ANALYSIS OF VARIANCE AMONG GRADE LEVELS 


(N = 96) 
SOURCE OF SUM OF SQUARED DF MEAN F 
VARIATION DEVIATIONS ° SQUARE RATIO 
of 
-AMONG SAMPLES 8457.06 3 2819.02 32.06°* 
WITHIN SAMPLES . 8088.81 92 87.92 
TOTAL 16545.88 95 ‘ 


* p (3/92) = 32.06; p > 0.01 


TABLE 4 


ANALYSIS OF MEAN DIFFERENCES 
‘AMONG GRADE LEVELS USING 
SCHEFFE'S TESTS ON CONTRASTS 


(N = 72) 
GRADES 1 2 at 4 
i — re Ya « 19,958" * 99,167" 
2 14.417"" 16.625" " 
3 2.208" 


\ 


4 ' - 


ak 
p < 0.01 
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TABLE 5 


t TEST BETWEEN ORAL ADMINISTRATION AND 
READING ADMINISTRATION OF THE BWVT 
FOR THIRD AND FOURTH GRADES 

“oy (N= 4B) 


MEA} MEAN 
GRADE? ORAL \ READING a SIGNIFICANCE 
RAW SCORE RAW SCORE VALUE : LEVEL 
3 37.8 27.6 6.9 0.001 


4 . 40.0 ‘ 31.3 6.3 0.001 


nN = 24 for each grade. 
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There were 68 subjects who had complete information for 
Oral BWVT, Gates reading vocabulary, Gates comprehension, 
and the Kuhlman-Anderson I.Q. Four fourth graders did not a, 
have IQ data. The results are shown in Table Be The oral . 

BWVT has a 5 r with the Gates (reading) vocabulary, a .59 7 

r with the Gates (reading) comprehension, and a .29 r with 1 spe? 


the Kuhlman-Anderson IQ test. 


TABLE 6 ‘s i. 
CORRELATION MATRIX OF ORAL BWVT RAW SCORES, 
GATES VOCABULARY GRADE SCORES, GATES 


COMPREHENSION GRADE SCORES, AND 
KUHLMAN-ANDERSON IQ SCORES 


= 68)8 
BWVT GATE GATES K-A 
=e ’ voc. comp.» 19° 
ey ee ae eT ae ee 
BWVT - aa +.59"" +.29" 
GATES Voc. ) re 7. +.40°" 
GATES COMP, . aa? 


K-A IQ , Z 
re 


®Data complete for second, third, and fourth grades only. 
Pnata were six months old for all grades. ts 


“Data were one year old for third graders and two years 
old for fourth gradérs. 


*p < 0.05 


**5 << 0.01 d 


Discussion 
The first question to be examined was the size of 


children's understanding vocabularies in “a one through 
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four. he results\(1, 780 for first grade, 2,330 for second 
grade, 3,780 for thied grade, and 4, 000 for fourth aia 
were much lower than\ the results of previous studies. ’ This 
is rather startling ‘tn light of the IQ and reading level of 
this experiment's population. One could expect students 
from a more normally distributed I0 population to score 
eyen lower. The low number of words is not due to students. 
‘doing poorly on the test. The instrument measures a max- 
imum basic word vocabulary of 12,300 words, which is Ph.D. 
level size of vocabulary. In the Ames (1964) study, 12,456 
words was the estimated vocabulary size of first graders. 
The issues raised by Lorge and Chall (1963) indicate rea- 
sons for aia ail vocabulary vies estimate. 

The first methodological issue, raised by Lorge dnd - 
Chall (1963), in the djgfionary sampling method, was dic- 
tionary size. The low estimates were notg@ve to using a 


small, abridged dictionary. The sample of main entry words 


was drawn from Webster's Third New International Dictionary 
which contains more than 450,000 entries. ro 

The second issue discussed by Lorge and Chall(1963) 
was the definition of a word. The criteria for the unit of 
measurement--the basic word--seem to be responsible for the 
low vocabulary size result. Dupuy: (1974, pe 5,6,7) es- 
timated the word population of Webster's Third New Interna- 
tional Dictionary to be 240,00 main entries. He reduced 


the one-percent samplezof 2,400 words to just 123 basic 


words by eliminating words classified as extra homographs, 
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compounds, abbreviation’, hyphenated, proper nouns, techni- 
cal, foreign, slang, archaic, derived, variant, or redun- 
dant. The basic words ‘also “had to be present ingthree oth- 
er large specified dictionaries. 

In comparing the procedures Ames (1964) used, there 
are fewer and less strict criteria for defining which words 
should be included for testing. jrmes’ sampled the Funk and 


Wagnalls New Standard Dictionary (1956). Using a table of 


random numbers, he chose 500 words that were main -entries.~ 


Four affixes and three abbreviations were eliminated, leav- 
ing 493 words. No danktod was made about compounds, hy- 
phenated forms, or proper names. No check was made to see 
if the 493 words were main entries in other diattonscion! 
Since the test was to be administered to first graders, 
Ames had five graduate students eliminate difficult words. 
There remained 181 words, which Ames tested on ten sixth 
graders. Ames f1964, p.66) “asked for responses until he 
‘was sure that at least one member of the group knew an ap- 
propriate meaning for the word." Neither set of students 
seemed to have specific criteria for eliminating words. 
‘Probably some foreign, slang, technical, and archaic words 
were eliminated, but ARES does not specifically state this. 
Nothing was said about derived, variant, Or redundant 
forms. Dupuy started wLEH a one percent sample of 2,400 
words and reduced it to 123. Ames started with a 500 word 


sample and reduced it to 121. Dupuy's 123 words range from 


first’ grade through Ph.D. level, while Ames' 121] words were 
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all meant for first graders. 
: 
The second question to be examined was the existence 


; f significant differences “afons the grade levels. The 
Cae of the analysis of variance shown in Table 3 and 
Scheffe's tests on contrasts shown in Table 4 verify that 
there were significant differences. It is interesting to 
note Bhat there is not such a great difference between 
grades bne and two, or between grades three and four, but 


there is a large difference between grades two and three. 


The difference could be due to a growth spurt with a subse-- 


quent plateau, or it could be that particular items were 
difficult for the fourth graders. A broader range study 
and/or an item analysis would give additional information 
‘on this question. 
ee 
The third question in the study dealt with the ex- 
istence ‘of significant differences pptecen oral and reading 
scores for the third and for the fourth grades. While the 


results of the t Tests ‘provide evidence for reliable 


differences, the results may have been confounded by the :% 


practice effect. Both.grades were administered the reading 
version first, then, one week later, the oral version of 
the .BWVT. For each grade level, the Goat scores could be 
higher due to practice alone, rather than an actual larger 
sized. oral understanding vocabulary. .This effect could 
have been avoided by administefing the BWVT as a reading 
test first to half of a grade, and as an oral test first to 


the other half. 


o4 


Biggie. 
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The last question of the study was concerned with the 
existence of correlations between the oral BWVT, Gates 
Reading Test results, and 10. The results of the oral ad- 
ministration of the BWVT correlate moderately (r = .74) 
with the Gates reading vocabulary for grades two, three, 
and four, but there is a lower correlation (r = .59) 
between the BWVT and the Gates comprehension section. One 
might expect the correlation between BWVT and comprehension 
to be lower than the correlation between vocabulary and 
BWVT. ‘There is a discrepancy in time in these 
calculations--the Gates* test had been given the preceding 
May, so .6 could be added to the .redding levels before per- 
Pouning-.theeerrelations. Perhaps using a different type 
of standardized reading vocabulary test (or vocabulary sec- 
tidn ) would also yield higher correlations. The Stanford 
Diagnostic Reading Test (upper sementary level) measures 
vocabulary (in stanines) by having children choose one of 
three orally pronounced choices that best completes an 
Orally pronounced sentence. This procedure is a closer ap- 
proximation of the oral administration of the BWVT. 

There is a low correlation with 10 (r #29, p < 05), 
This is an unusual result because, generally, vocabulary 
tests and IQ do correlate highly. Again, the time of IQ 
test administration must be taken into account. While the 
IQ data are three months old for second graders, the data 
are one year and two years old for third and fourth graders 


respectively. The correlation may be higher with more 
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‘cutrent IQ data. 

Since the BWVT was administered as a reading test to 
the third and fourth graders, the results-can be directly 
compared to Dupuy's study. The raw score means for Dupuy's 
_ third and.fourth graders were 15.3, and 27.0, respectively, 
while the reading raw score means from this study were 27.6 
(third grade) and 31.3 (fourth grade). Each grade in the 
two studies was tested midway through the academic year, so 
differences between studies were not due to extra instruc- 
tional time. The chronological-age grade placement of the 
students in this study was 3.6 for third graders and 4.6 
for fourth graders, indicating they were within the normal 
‘range. However, the mental-age grade placement was 5.2 for 
third graders, and 6.2 for fourth graders, indicating this 
sample had above average ability. 
otis study showed the feasibility of administering the 
BWVT orally. : Even though the sample size was small (N= 
96), and the data from student files was two years old .for 
some subjects on the correlations, significant information 


about vocabulary size was found. 
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CHAPTER V 


SUMMARY AND CONCLUSIONS 


“ ‘ 
This chapter summarizes the study, draws conclusions 


from the research results, and offers suggestions for addi- 


tional research on vocabulary size. 


Summary 
This study was an attempt to extend the Basic Word Vo- 


cabulary Test (BWVT) downward through oral administration. 
The first question to Be investigated in the study was, 
“How large are the aural understanding baste word vocabu- 
laries of children in grades one through four?" In order to 


answer this question, 96 subjects from grades one through 


* 
four were randomly selected to participate in the study. - 


The subjects wenty,to “the one township school in a Central 
New Jersey suburban community. The BWVT was administered 
as a reading test first to then and fourth graders, and 
one week later, the BWVT was gives to them orally. The 
first and second graders had oral administration only, and 
they were tested in groups “of four at a time. The raw 


scofes were converted into means by grade level and multi- 


plied by 100 to estimate, aural vocabulary sizes. First 


graders knew 1,780 basic words, second graders knew 2,330 ° 
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basic words, third graders knew 3,780 basic words, . and 


fourth graders knew 4,000 basic words. 


ae The second question to be investigated was, "Do signi- 


ficant differences exist among grade levels?" A one way 
analysis of variance showed main effect for grade (F = 
32.06, df = 3/92, p <.01). A further analysis using 
Scheffe's tests on contrasts showed all pairwise compari- 
sons were significant (p < .01). 

The third question to be studied was, "Do significant 
differences exist between the oral BWVT scores and the 
reading BWVT scores of third and fourth grade children?" A 
t test was performed between the third grade oral BWVT mean 
(37.8) and the reading, mean (27.6). The resulting t value 
of 6.9 was significant at the .001 level. A t test was 
also per formed between the fourth grade oral BWVT mean 
(40.0) and the the reading mean (31.3). The resulting t 
value of 6.3 was also significant at the .001 level. 
fesuite may have been confounded due to the practice ef- 
fect. | 

The last question to be investigated was, "What corre- 
PER LOnS exist between the oral scores and IQ, and between 
oral scores and the Gates-MacGinitie Reading Test?" A 
moderate correlation of ..74 between the oral BWVT and the 
Gates vocabulary test was found, while a lower correlation 


of .59 between the BWVT and the Gates comprehension test 


‘was found. The Kuhlman-Anderson IQ Test had a very low 


correlation of .29. There is a discrepancy of time in the 
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data taken from students’ files. The reading test data is 
six months old, and the IQ data is one year old for third 
graders, and two years old for fourth graders. The corre- 
lations may actually be higher than the results indicate. 
Conclusions 

Many problems exist in vocabulary size research. The 
BWVTA is an instrument that overcomes one of the biggest 
difficulties--the’ definition of the unit of measurement-- 
the basic word. However, because basic words have to meet 
several criteria, there are less of them in proportion to 
the total word population estimate. This makes for a lower 
vocabulary size estimate than the results of previous stu- 
dies. 

In this study, the major conclusion is that the BWVT 
can be extended downward through oral administration. How- 
ever, in its present format, it is best administered to 
groups of four or five at the first and second grade lev- 
els. Vocabulary size at each of the grade levels was 
lower than previous estimates, such as Ames (1964). To 
echoonive, Make low vocabulary size is probably due to the 
strict\criteria in defining the unit of measurement--the 
basic word. 

The results provide evidence for the importance of vo- 
cabulary teaching. The children came to school knowing a 
certain,amount of words, and increased their vocabularies 


fe 
significantly from year to year while in school. There was 
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no special emphasis on vocabulary,study, so a curriculum 
that would emphasize vocabulary study might produce stu- 
dents who would score within the nent higher ‘wtandare devi- 
ation. 

From Dupuy's study, it can be seen that the total 
Aunber. 6£ basic words to learn is ralatively small, com- 
pared to the total number of dictionary entries. If this 
is true, students would know more "words" if they knew some 
basic words, and a few rules for adding affixes, making 
compounds, and variant forms. Additional study in this 


area could lead to some curriculum change. 


Suggestions for Further Research 

. The first suggestion for further study is to replicate 
this experiment, using a larger number of subjects. The 
subjects should be of a more normally distributed? 10 popu- 
Site One should expect the results to.be lower vocabu- 
lary size estimates, in that case. 

Another suggestion is that, since the BWVT yields an 
absolute Vocabulary al ite could be used to measure wide 
range vocabulary growth. A longitudinal study on individu- 
als over a two-year, three-year or longer period would give 
results indicating how much growth they made. If the study 
were done over different grade levels, trends in rate of 
vocabulary growth could be monitored. 


ee 
The reading Nocabulary size is larger than the under- 


standing vocab#lary size in adults. At what point the 
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reading vocabulary becomes larger than the oral understand- 
ing vocabulary could be studied. The BWVT could be admin- 
istered orally and as a reading test to several grade lev- 
els above fourth to find the crossover point. Care must be 
taken to avoid the practice effect: each grade could be di- 
vided in half, with one half taking the oral version first 


and the other group taking the reading version first. 
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